Synthesising Uncertainty: The Interplay of Vocal Effort and Hesitation Disfluencies
نویسندگان
چکیده
As synthetic voices become more flexible, and conversational systems gain more potential to adapt to the environmental and social situation, the question needs to be examined, how different modifications to the synthetic speech interact with each other and how their specific combinations influence perception. This work investigates how the vocal effort of the synthetic speech together with added disfluencies affect listeners’ perception of the degree of uncertainty in an utterance. We introduce a DNN voice built entirely from spontaneous conversational speech data and capable of producing a continuum of vocal efforts, prolongations and filled pauses with a corpus-based method. Results of a listener evaluation indicate that decreased vocal effort, filled pauses and prolongation of function words increase the degree of perceived uncertainty of conversational utterances expressing the speaker’s beliefs. We demonstrate that the effect of these three cues are not merely additive, but that interaction effects, in particular between the two types of disfluencies and between vocal effort and prolongations need to be considered when aiming to communicate a specific level of uncertainty. The implications of these findings are relevant for adaptive and incremental conversational systems using expressive speech synthesis and aspiring to communicate the attitude of uncertainty.
منابع مشابه
Do social anxiety individuals hesitate more? The prosodic profile of hesitation disfluencies in Social Anxiety Disorder individuals
Building on psychologists' observations that individuals with Social Anxiety Disorder (SAD) speak slower and more quietly, this study examines to what extent the characteristics of hesitation disfluencies and silent pauses distinguish between SAD and control participants. Participants responded verbally to six identical questions, and their responses were recorded and analyzed. Our first observ...
متن کاملPhonological Aspects of Hesitation Disfluencies
An effective approach to the study of prosody in spoken language seeks to identify prosodic patterns and their communicative values, and to subsequently find a correlation between these prosodic patterns and other layers of linguistic structure. The present research strives to define a single prosodic boundary pattern: the boundary tone of hesitation disfluencies in spontaneous Israeli Hebrew. ...
متن کاملHesitation disfluencies in spontaneous speech:
Human speech is peppered with ums and uhs, among other signs of hesitation in the planning process. But are these so-called fillers (or filled pauses) intentionally uttered by speakers, or are they side-effects of difficulties in the planning process? And how do listeners respond to them? In the present paper we review evidence concerning the production and comprehension of fillers such as um a...
متن کاملThe linguistic role of hesitation disfluencies: evidence from Hebrew and Japanese
In this paper we examine a certain aspect of prosodysyntax interface, that of hesitation disfluencies (HD) that occur intra-phrases or intra-morphemes. Such cases were found in two spontaneous corpora of two syntactically distinct languages – Israeli Hebrew (IH) and Japanese. It was found that intra-phrasal hesitations in the two languages calls for different explanations, since in Japanese the...
متن کاملCan You Hear These Mid-front Vowels? Formants Analysis of Hesitation Disfluencies in Spontaneous Hebrew
This study attempts to characterize the timbre of the default type of hesitation disfluency (HD) in Israeli Hebrew: the mid-front vowel /e/. For this purpose, we analysed the frequencies of the first three formants, F1, F2, and F3, of hundreds of HD pronunciations taken from The Corpus of Spoken Israeli Hebrew (COSIH). We also compared the formant values with two former studies that were carrie...
متن کامل